Abstract: Big data analytics is the process of examining big data to uncover hidden patterns, unknown correlations and other useful information that can be used to make better decisions. The main goal of this project is to understand and implement the entire process of data mining and analytics. We will be extracting the information from data sources by implementing a web crawler. To remove the inconsistencies in the extracted data we will be cleaning it. The cleaned data will be migrated to database, analyzed and visualized.

Keywords: Web crawler, open refine, visualization, analytics.